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LOG -STRUCTURED ARRAY 

Field of the Invention 

The present invention relates generally to a method 
5 and system of adding storage devices to a log- structured 

array storage system. 

Background of the Invention 

Many current mid to high-end data processing systems 

10 employ storage systems that comprise arrays of small form 

factor direct access storage devices such as magnetic disk 
drives. Such arrays are designed to emulate large form 
factor devices while offering the advantages of lower cost, 
smaller size, improved performance and reduced power 

15 consumption. A number of alternative array architectures 

have been developed to achieve good overall array 
reliability; one of which is RAID (Redundant Array of 
Independent Disks) defined by the RAID advisory board and 
which includes a number of variants (commonly called 

20 'levels') . One commonly used variant is RAID 5 in which a 

data stripe consists of a number of data strips and a 
parity strip, each strip being stored on one member disk of 
the array. In RAID 5 the parity strips of different data 
stripes are distributed across the member disks. Further 

25 details of RAID 5 and the other variants may be found in 

the 'RAID book' (ISBN 1-57398-028-5) . 
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A more recently developed array architecture is the 
Log Structured Array (LSA) in which the Log Structured 
Filesystem (LSF) approach, pioneered at the University of 
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California Berkeley, is combined with RAID. Examples of 
systems employing the LSA architecture are described in US 
Patents 5,124,987 and 5,671,390. LSA is most often 
described as being implemented using RAID 5 techniques. 

5 

An LSA consists of a disk controller and N+l physical 
disks. In an LSA, data is stored on disks in compressed 
form. After a piece of data is updated, it may not 
compress to the same size as it did before it was updated, 

10 so it may not fit back into the space that had been 

allocated for it before the update. The implication is 
that there can no longer be fixed, static locations for all 
the data. An LSA controller manages information storage to 
write updated data into new disk locations rather than 

15 writing new data in place. Therefore, the LSA must keep a 

directory which it uses to locate data items in the array. 

As an illustration of the N+l physical disks, an LSA 
can be considered as consisting of a group of disk drive 

20 DASDs, each of which includes multiple disk platters 

stacked into a column. Each disk is divided into large 
consecutive areas called segment -columns . A segment -column 
is typically as large as a physical cylinder on a physical 
disk. Corresponding segment -columns from the N+l disks 

25 constitute a segment. The array has as many segments as 

there are segment -columns on a disk in the array. One of 
the segment -columns of a segment contains the parity 
(exclusive-OR) of the remaining segment -columns of the 
segment. For performance reasons, the parity 
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segment -columns are not all on the same disk, but are 
rotated among the disks. 

Logical devices are mapped and stored in the LSA. A 
5 logical track is stored, as a set of compressed records, 

entirely within some segment of the LSA; many logical 
tracks can be stored in the same segment. The location of 
a logical track in an LSA changes over time. A directory, 
called the LSA directory, indicates the current location of 
10 each logical track. The LSA directory is maintained 

either in Non-Volatile Storage (NVS) or paged virtual 
memory in the disk controller. 

Reading and writing into an LSA occurs under 
15 management of the LSA controller. An LSA controller can 

include resident microcode that emulates logical devices 
such as direct access storage device (DASD) disk drives, or 
tape drives. In this way, the physical nature of the 
external storage subsystem can be transparent to the 
20 operating system and to the applications executing on the 

computer processor accessing the LSA. Thus, read and write 
commands sent by the computer processor to the external 
information storage system would be interpreted by the LSA 
controller and mapped to the appropriate disk storage 
25 locations in a manner not known to the computer processor. 

This comprises a mapping of the LSA logical devices onto 
the actual disks of the LSA. 

A write received from the host system is first written 
30 into a non-volatile cache and the host is immediately 
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notified that the write is done. The fraction of cache 
occupied by modified logical tracks is monitored by the 
controller. When this fraction exceeds some threshold, 
some number of modified logical tracks are moved 
5 (logically) to a memory segment, from where they get 

written (destaged) to disk. The memory segment is a 
section of controller memory, logically organized as N+l 
segment -columns called memory segment - columns ; N data 
memory segment -columns and 1 parity memory segment -column . 

10 When all or part of a logical track is selected from the 

NVS, the entire logical track is written into one of the N 
data memory segment - columns . When all data memory 
segment -columns are full, an XOR operation is applied to 
all the data memory segment -columns to create the parity 

15 memory segment -column, then all N+l memory segment -columns 

are written to an empty segment on the disk array. It is 
important for the performance of LSA that a segment is an 
integral number of RAID 5 stripes so that parity can be 
calculated over the destaged data without reading the old 

20 data and parity and so incurring the RAID 5 write penalty. 

All logical tracks that were just written to disk from 
the memory segment must have their entries in the LSA 
directory updated to reflect their new disk locations. If 
25 these logical tracks had been written before by the system, 

the LSA directory would have contained their previous 
physical disk locations; otherwise the LSA directory would 
have indicated that the logical track had never been 
written, and consequently has no address. Note that 
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writing to the disk is more efficient in LSA than in 
RAID- 5, where 4 disk accesses are needed for an update. 

In Log Structured Arrays, data to be written is 
5 grouped together into relatively large blocks (the 

segments) which are written out as a unit in a convenient 
free segment location on disk. When data is written, the 
previous disk locations of the data become free creating 
"holes" of unused data (or garbage) in the segments on 

10 disk. Eventually the disk fills up with segments and it is 

necessary to create free segment locations by reading 
source segments with holes and compacting their 
still-in-use content into a lesser number of destination 
segments without holes. This process is called free space 

15 or garbage collection. 

When the user of a non-LSA RAID array wishes to expand 
the storage capacity of the array, one option is to add one 
or more new member disk drives to the array. In RAID 5 

2 0 arrays, the problem arises as to how to remap the data 

and/or parity whilst adding one or more disks. Furthermore 
a number of characteristics are important for any RAID 
array expansion scheme : (i) host IOs should not be held 
off for long periods of time; (ii) the amount of 

25 data/parity that needs to be moved should be minimised; and 

(iii) the translation of logical address to physical 
address should be kept as simple as possible since this 
calculation must be performed for every array 10 access. A 
number of patents including US patents 5502836, 5524204, 
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5615352 and 5758118 address the aforementioned problem and 
seek to trade off these desirable characteristics. 

Expansion capability for log structured arrays is also 
5 desirable. One relatively straight forward way of doing 

this would be to add a complete new RAID array to the 
log- structured array, as a log structured array may consist 
of one or more RAID arrays. However, the LSA user may not 
need all the additional storage capacity provided by an 

10 additional RAID array and therefore it would often be 

preferable to increase storage capacity by adding one or 
more member disk drives to a RAID array which is already a 
member of an LSA. However, a number of additional problems 
arise in adding new members to a RAID array that forms part 

15 of an LSA, none of which are addressed in the prior art. 

These are: (i) LSA performance is tied to the ability to 
perform full stripe writes for destaging segment sized IOs 
to the array. Thus the logical block addresses of the 
blocks in a stripe after the array expansion must be 

20 sequential; (ii) since the LSA segment size is tied to the 

array stripe size, if this changes, the segment size for 
new destages must change to match; and (iii) since LSA 
obtains new segments for filling by employing a garbage 
collection algorithm over previously destaged segments, it 

25 is convenient that newly destaged segments are the same 

size as pre-existing segments. 
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It would be desirable to provide an improved 
expansion/contraction scheme for an information storage 
system configured as a log structured array. 
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Summary of the Invention 

According to a first aspect of the invention there is 
provided a log structured array (LSA) controller apparatus 
for controlling the transfer of information between a 
5 processor and a plurality of information storage devices 

configured as an N+l array in which the information is 
stored as stripes extending across the devices of the 
array, each stripe comprising N information strips and one 
parity strip, each information strip storing an integer 

10 number of logical tracks; the controller further defining 

an LSA directory which specifies the location of each 
logical track in terms of the ID of the stripe to which the 
logical track belongs and the offset of the logical track 
within the stripe; wherein on the addition of an 

15 information storage device to the array, the additional 

strip provided for each stripe by the storage device is 
logically appended to the end of each stripe in the LSA 
directory. 

2 0 According to second aspect of the invention there is a 

provided a method of adding an information storage device 
to a plurality of information storage devices in an 
information processing system in which a processor is 
connected for communication with the information storage 

25 devices by means of a log structured array controller, the 

plurality of information storage devices being configured 
as an N+l array in which the information is stored as 
stripes extending across the devices of the array, each 
stripe comprising N information strips and one parity 

30 strip, each information strip storing an integer number of 
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logical tracks, the controller further defining an LSA 
directory which specifies the location of each logical 
track in terms of the ID of the stripe to which the logical 
track belongs and the offset of the logical track within 
5 the stripe, the method comprising connecting the additional 

information storage device to the log- structured array 
controller and logically appending the additional strip, 
provided to each existing stripe by the additional storage 
device, to the end of each stripe in the LSA directory. 

10 

The present invention is thus predicated on the LSA 
directory defining the location of a logical track within 
the array using the construct {stripe number , of f set } rather 
than the traditional construct {logical block address}. In 
15 this way, the additional strip provided by the additional 

storage device for each existing stripe is logically 
appended to the end of the existing stripes and the 
addresses previously stored in the LSA are still correct. 

2 0 In a preferred embodiment, for array configurations 

where prior to the addition of the additional storage 
device the parity strips are rotated amongst the N+l 
information storage devices in accordance with the RAID- 5 
architecture, the method further comprises moving selected 

25 parity strips to the additional information storage device 

at locations that would have stored parity strips had the 
array originally comprised N+2 information storage devices. 
In this way skew in the load on the storage devices is 
avoided . 
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A preferred embodiment of the present invention will 
now be described, by way of example only, with reference to 
the accompanying drawings . 

5 
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Brief Description of the Drawings 

Figure 1 is a representation of a computer system 
constructed in accordance with an embodiment of the present 
invention: 

5 Figure 2 is a tabular representation of an array of 

3+P disk storage devices prior to the addition of the extra 
disk storage device; 

Figure 3 is a tabular representation of an expanded 
array prior to transfer of information to the extra disk 
10 storage device; and 

Figure 4 is a tabular representation of an expanded 
array in which parity strips have been moved to the 
additional disk storage device in accordance with one 
embodiment of the present invention. 
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Detailed Description of the Preferred Embodiment 

Figure 1 shows a preferred embodiment of a computer 
system 100 constructed in accordance with the present 
invention. The system 100 includes a processor 102 or host 
5 computer that communicates with an external information 

storage system 104 having N+l direct access storage devices 
(DASD) in the form of disks in which information is 
maintained as a log structured array (LSA) . In accordance 
with the RAID 5 architecture, each disk is logically 

10 divided into large consecutive areas called segment -columns 

where a segment -column is typically as large as a physical 
cylinder on a physical disk. Corresponding segment -columns 
from the N+l disks constitute a segment. The array has as 
many segments as there are segment -columns on a disk in the 

15 array. One of the segment -columns of a segment contains 

the parity (exclusive-OR) of the remaining segment -columns 
of the segment. For performance reasons, the parity 
segment -columns are not all on the same disk, but are 
rotated among the disks. 

20 

In Figure 1, an array 106 comprising four disks 106a, 
106b, 106c, and 106d is shown for illustration, but it 
should be understood that the array may include a greater 
or lesser number of disks. A control unit 108 controls the 

25 storage of information so that the array 106 is maintained 

as an LSA. Thus, the DASD recording area is divided into 
multiple segment -column areas and all like segment -columns 
from all the disks comprise one segment's worth of data. 
The control unit 108 manages the transfer of data to and 

30 from the array 106 so that periodically it considers 
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segments for free space and selects target segments 
according to a fitness function described in detail below. 

The processor 102 includes (not illustrated) : one or 
5 more central processor units, such as a microprocessor, to 

execute programming instructions; random access memory 
(RAM) to contain application program instructions, system 
program instructions, and data; and an input /output 
controller to respond to read and write requests from 

10 executing applications. The processor 102 may be coupled 

to local DASD (not illustrated) in addition to being 
coupled to the LSA 104. Typically, an application program 
executing in the processor 102 may generate a request to 
read or write data, which causes the operating system of 

15 the processor to issue a read or write request, 

respectively, to the LSA control unit 108. 

When the processor 102 issues a read or write request, 
the request is sent from the processor to the control unit 

20 108 over a data bus 110 and is received in the control unit 

by a controller 112. In response, the controller produces 
control signals and provides them over a controller data 
path 114 to an LSA directory 116 and thereby determines 
where in the LSA the data is located, either in a 

25 non-volatile LSA data cache 118 or in the DASD 106. The 

LSA controller 112 comprises one or more microprocessors 
with sufficient RAM to store programming instructions for 
interpreting read and write requests and for managing the 
LSA 104 in accordance with the present invention. 
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Data is transferred between the processor 102 and the 
LSA 104 during read operations over a path including a read 
data path 120, DASD access circuits 122, the LSA data cache 
118, controller access circuits 124, the controller data 
5 path 114, the controller 112, and the data bus 110. Data 

is transferred during write operations over a path 
including the data bus 110, the controller 112, the 
controller data path 114, the controller access circuits 
124, the LSA data cache 118, the DASD access circuits 122, 
10 a segment data path 12 6, an accumulating memory segment 

input write buffer 128, and a DASD write path 130. 

The data cache 118 permits delay of write operations 
on modified data logical tracks to the memory segment 12 8 

15 for purposes of maintaining seek affinity. More 

particularly, if write operations to adjacent logical 
tracks are received, then all modified data in logically 
adjacent tracks will be moved into the memory segment 128 
at the same time so they are stored in the same 

20 segment -column. This helps keep together logical tracks 

that are adjacent in the data cache so they will be 
adjacent when moved into the DASD array 106, thereby 
preserving seek affinity. The advantages and operation of 
the data cache 118 are described in greater detail in U.S. 

25 Pat. No. 5,551,003 issued Aug. 27, 1996 and assigned to 

International Business Machines Corporation. 
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Preferably, the LSA data cache 118 is managed as a 
least-recently-used (LRU) cache, so that data is queued in 
the cache, with the most recently stored data at the top or 
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(front) of the queue. In particular, the LSA data cache 
118 is organized with clean data tracks in one LRU list and 
dirty tracks in another LRU list. The clean LRU list 
specifies logical tracks containing information wherein the 
5 data in the LSA cache is the same as the data in the DASD 

array, and the dirty LRU list specifies logical tracks 
containing modified data wherein data is different from the 
data in the DASD array. 

10 A basic operation of the storage system 104 is to 

write a particular logical track so as to change the 
contents of the logical track. In general, such live data 
tracks are first placed in the non-volatile data cache 
memory 118 of the LSA control unit 108. When the fraction 

15 of the cache occupied by modified logical tracks exceeds a 

predetermined value, the controller 112 logically moves a 
set number of modified tracks to the memory segment 128 by 
assigning them there. After one segment's worth of live 
tracks are moved into the memory segment, the tracks are 

20 written into contiguous locations of the DASD array 106. 

It should be understood that the operation of the data 
cache 118 is transparent to the processor 102 and therefore 
some operations of the storage system 104 will be described 
from the perspective of the processor, without reference to 

25 the data cache. Although the inclusion of a data cache 118 

as described above can improve the overall performance of 
an LSA system, it should be understood that the inclusion 
of a data cache and the details of its implementation are 
not essential to the invention. 
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The smallest unit of data that can be addressed by the 
LSA directory 116 is called a logical track. If the 
processor 102 writes data comprising part of a track, the 
LSA control unit 108 must read the remainder of the logical 
5 track from the DASD array 106 making reference to the 

current location of the logical track stored in the LSA 
directory 116 before writing the updated complete track 
into the memory segment buffer 128. A variable number of 
compressed logical tracks comprise a segment (depending 

10 upon how well the logical tracks compress) . At any time, a 

logical track is live, or current, in only one segment. In 
all other segments, the logical track is outdated, also 
referred to as being a dead track. From the perspective of 
the processor 102, a live data track is initially stored 

15 into controller memory (such as the data cache 118 or the 

input memory segment write buffer 12 8) comprising a segment 
SEG 0 that initially is empty. That is, the segment SEG 0 
resides in the controller memory as the segment is filled. 

20 If a logical track k is being written into the segment 

SEG 0 of controller memory and if the logical track k was 
previously live in some other DASD segment SEG in the DASD 
106 before the write operation, then the track k becomes 
dead in the segment SEG and becomes live in the controller 

25 segment SEG 0 being filled. This continues until the segment 

SEG 0 in the LSA controller memory is filled to capacity, at 
which time the segment SEG 0 is destaged, i.e. it is moved 

* 

from the memory segment buffer 12 8 and written to the DASD 
array 106. Another segment's worth of data is then filled 
3 0 in the controller memory and the process repeats until the 
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next destage operation. In practice for performance reasons 
this logic may be duplicated resulting in multiple memory 
segment buffers 128 being filled concurrently. This 
concurrency does not change the basic principle of 
5 operation. 

As discussed above, it may become desirable to 
increase the overall storage capacity of the LSA. The 
present invention provides a technique whereby one or more 
10 DASDs can be added to the array. In the following 

description there will be detailed the process for adding a 
single additional DASD to an existing array. However the 
techniques are applicable to the task of adding more than 
one new disk at a time. 

15 

In the present invention, the LSA directory describes 
the location of a track within a specific RAID array by 
using the construct {stripe number, offset} rather than 
simply {logical block address} . To use the technique and 

2 0 system of the present invention, it is required to employ 

an arrangement wherein a LSA segment maps to exactly one 
RAID stripe. Thus, a RAID strip and an LSA Segment column 
are the same thing and the block offset into a segment is 
the same as the block offset into the RAID stripe. It can 

2 5 be shown that this is the optimal arrangement for LSA in 

any case. These two predicates allow for the size of a 
segment to increase and potentially decrease (see the 
following discussion) without affecting the contents of the 
existing LSA directory. 
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With reference now to Fig. 2 there is shown in tabular 
form a 3+P RAID 5 array with a strip size of S logical 
blocks in which the data strips are numbered DO Dl D2 etc. 
and the parity strips are numbered PO PI P2 etc. In this 
5 arrangement, each data strip starts at an array LBA given 

by its strip number (DO, Dl etc.) multiplied by S. Thus 
data strip D2 starts at LBA 2*S. 

It will be noted in passing that many RAID 5 
10 implementations rotate the first strip in a stripe around 

the disks in the same way that that the parity rotates. 
This is usually done to improve the performance of 
sequential reads. Since sequential reads at the disk level 
are rare in LSA, this feature is not as attractive. The 
15 arrangement shown above was chosen for LSA in the present 

invention because it reduces the number of data/parity 
moves when a disk is added. 

In the current invention, the procedure for adding a 
20 disk to the RAID array is as follows: 

1. The new disk is initialised to all binary 00s so that it 
can be included in the parity calculations without 
modifying the parity already on disk. 

25 

2 . Accesses to the RAID array are temporarily suspended and 
any data cached by RAID 5 array location is flushed from 
the cache . 
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3. The new disk (disk 4) is added as a member of the RAID 
array. The arrangement at this point in the process is as 
shown in Fig . 3 . 

5 4. There are now several algorithms that can be applied to 

optionally relocate the parity and/or the data. 

In the preferred embodiment, the data and parity are 
migrated from the arrangement of Fig . 2 to the arrangement 

10 of Fig. 3 such, that the data and parity are moved to the 

position that they would have occupied had the array 
originally been created with 4 disks. In the preferred 
embodiment this migration happens gradually, alongside and 
as part of the normal 10 workload of the disks with little 

15 impact. The algorithm for this task is as follows: 

A bitmap is initialised which has a single bit to 
represent each stripe in the array. The bit value indicates 
whether the data and parity in the strip are in the 
20 position shown in Fig. 2 or the position shown in Fig. 3. 

At this point, IO accesses to the array can be 
enabled. The algorithm for servicing them is as follows: 

25 Reads/Cache stages: If the strip being accessed is one 

for which the position of the data is different in Fig. 3 
from the position in Fig. 2 then the bitmap is examined in 
order to determine which disk to access. 
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Destage operations are much simpler since these always 
occur as full stripe writes. In this case the data and 
parity are written at the position shown in Fig. 3 and the 
bitmap is updated if necessary to show that the data and 
5 parity have been moved. 

In an alternative arrangement, a background process is 
set up to migrate the data and parity from Fig. 2 format to 
Fig. 3 format. This is not preferred since it is much 

10 easier to migrate the data and parity between the formats 

during a full stripe write (no locking or reading needs to 
be performed) and because the skewing of the load on the 
disk has only very slight impact for the read only workload 
which is applied to the disks between segment destages in 

15 LSA. 

If the data and parity is relocated by a background 
process then it should be noted that the amount of parity 
and data which need to be moved is much less than 

2 0 (approximately half) that which has to be moved in schemes 

of the prior art which maintain optimal parity rotation and 
keep full strips sequential on disk. For example, in stripe 
0 no movement at all needs to take place since the position 
of the data strips has not changed and we pre- initialised 

25 disk 4 with zeros so it can be used as the parity. In 

stripe 1 only PI and D5 need to be swapped, leaving D3 and 
D4 where they were . 



The benefit of moving the parity to the Fig. 3 format 
30 is that the optimal rotation of parity on the disks is 
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maintained. This is traded off against the cost of 
relocating the parity. 

In an alternative embodiment the parity is not moved, 
5 rather it is maintained in the format shown in Fig. 2. The 

benefit of this approach is that no parity movement must 
take place but this is at the cost of slightly skewing the 
load on the disks. A second alternative would be to swap 
the location of a parity strip and a single data strip on 

10 each stripe to achieve the optimal rotation of parity on 

the disks, but without keeping a sequential ordering of 
data strips across the disks. This would further reduce the 
amount of parity and data which need to be moved but would 
make the calculation to determine the location of data 

15 strips more complex. 

5. After adding the new drive to the array, each stripe now 
has an unused strip which is located logically as the last 
strip in the stripe. 

20 

The appending of an extra strip onto the end of each 
stripe would not be viable if the addresses in the LSA 
directory were based upon logical block address. The 
addition of an extra strip into each stripe has changed the 
25 logical block address of every block of data after the 

first stripe in the array. Since in accordance with the 
preferred embodiment, the address stored in the LSA 
directory is of the form stripe {number , of f set } and the new 
strip is logically appended at the end of the existing 
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stripes, the addresses stored in the LSA directory are 
still correct. 

All that has happened therefore is that each 
5 pre-existing segment has been expanded by the addition of 

some unused space at the end of the segment. This newly 
added space will become available for new destages as 
segments are garbage collected during the normal operation 
of the LSA . 

10 

It is also possible to run this procedure in reverse 
to remove a disk from a RAID array LSA garbage 
collection is used to shrink every segment on the RAID 
array such that it no longer occupies the last strip in the 
15 stripe. As each segment is written back to the array, the 

strip on disk 4 is filled with zeros and the parity is 
written in the format shown in Fig. 2. 

What is claimed is: 

20 
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